Architectural and Software Support for Executing Numerical Applications on High Performance Computers By
نویسندگان
چکیده
Numerical applications require large amounts of computing power. Although shared memory multiprocessors provide a cost-e ective platform for parallel execution of numerical programs, parallel processing has not delivered the expected performance on these machines. There are two crucial steps in parallel execution of numerical applications: (1) e ective parallelization of an application and (2) e cient execution of the parallel program on a multiprocessor. This thesis addresses the second step within the scope of automatically parallelized FORTRAN programs. In this thesis, the mismatch between the needs of parallelized FORTRAN programs and the support for parallel execution in shared memorymultiprocessors is identi ed as a cause of poor performance. The thesis addresses this problem from two angles, architectural and software support for parallel execution and compiler transformation to enhance program characteristics. Architectural features and synchronization and scheduling algorithms are studied to increase the e ciency of support for parallel execution. It is shown that architectures supporting atomic fetch&add primitives and synchronization busses can execute parallel programs more e ectively. New algorithms for lock access and parallel task scheduling are proposed. iv The thesis also explores compiler transformations which can modify parallel program characteristics to increase the parallel execution e ciency of less sophisticated architectures. It is shown that by using blocking transformations on nested parallel loops, program characteristics can be modi ed to decrease the need for scheduling and synchronization operations. This results in an increase in the e ciency of parallel execution, especially for multiprocessors with simplistic support for interprocessor synchronization.
منابع مشابه
Architectural Support for Managing Nomadic Applications and Services
In the increasingly wireless world of data communications; laptops, palmtops and other nomadic computers are appearing that have built-in, wireless access (e.g. cellular, radio, infrared, satellite access) to one or more wireless networks around them. Applications executing on these nomadic computers need to be able to locate services and enter into one or more service-level agreements with ser...
متن کاملInterventions for Scientific and Enterprise Applications Based on High Performance and Cloud Computing
High performance computing refers to the practice of aggregating computing power in a way that delivers much higher performance than one could get out of a typical desktop computer in order to solve large problems in science, engineering or business. While cloud computing is a model for enabling convenient, on-demand network access to a shared pool of configurable computing resources (e.g., net...
متن کاملLinear Algebra on High Performance Computers
This is a survey of some work recently done at Argonne National Laboratory in an attempt to discover ways to construct numerical software for high performance computers. The numerical algorithms discussed are taken from several areas of numerical linear algebra. We discuss certain architectural features of advanced computer architectures that will affect the design of algorithms. The technique ...
متن کاملComputational Fluid Dynamics Simulation and Experimental Validation of Hydraulic Performance of a Vertical Suspended API Pump (RESEARCH NOTE)
For a long period of time, design and manufacturing technology of high flow rated vertically suspended pumps (VSPs) which have an extensive applications in many industries such as water and wastewater, mining, petrochemical and oil and gas industries, used to be imported from European countries. For the first time in Iran's pump industry, with the support of Ministry of Petrochemical[ah1] and ...
متن کاملLNCS 7851 - High Performance Computing for Computational Science - VECPAR 2012
The development of an exascale computing capability with machines capable of executing O(10) operations per second by the end of the decade will be characterized by significant and dramatic changes in computing hardware architecture from current (2012) petascale high-performance computers. From the perspective of computational science, this will be at least as disruptive as the transition from ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1993